-
Notifications
You must be signed in to change notification settings - Fork 211
Support huggingface popular weight format for weight-only quantization #1580
Conversation
Signed-off-by: Cheng Penghui <[email protected]>
⚡ Required checks status: All passing 🟢Groups summary🟢 Format Scan Tests workflow
These checks are required after the changes to 🟢 Optimize Unit Test workflow
These checks are required after the changes to 🟢 NeuralChat Unit Test
These checks are required after the changes to 🟢 Engine Unit Test workflow
These checks are required after the changes to Thank you for your contribution! 💜
|
depend on ipex gpu xetla kenel ready. |
Signed-off-by: Cheng Penghui <[email protected]>
Signed-off-by: zhenwei-intel <[email protected]>
Type of Change
feature
No API changed
Description
Support huggingface woq model format for intel GPU
Expected Behavior & Potential Risk
support AutoGPTQ model on huggingface models hub for WOQ on intel GPU